Correlation Analysis of Acoustic Features with Perceptual Voice Quality Similarity for Similar Speaker Selection
نویسندگان
چکیده
This paper describes the correlations between various acoustic features and perceptual voice quality similarity. We focus on identifying the acoustic features that are correlated with voice quality similarity. First, a large-scale perceptual experiment using the voices of 62 speakers is conducted and perceptual similarity scores between each pair of speakers are acquired. Next, multiple linear regression analysis is carried out; it shows that five acoustic features exhibit high correlation to voice quality similarity. Last, we perform similar speaker selection based on multiple linear regression with the above features and moreover, assess its performance by classifying speakers based on the perceptual similarity. The results indicate that the combination of the five acoustic features in classifying speakers into two classes is effective in choosing speakers with similar voice quality; it reduces the error rate by about 44 % compared to using just the cepstrum.
منابع مشابه
Similar Speaker Selection Technique Based on Distance Metric Learning with Perceptual Voice Quality Similarity
This paper describes a similar speaker selection technique based on distance metric learning. Our aim is selection of a perceptually similar speaker using acoustic features from a multispeaker database. A novel point of the proposed technique is training a transform matrix using the perceptual voice quality similarity between many speakers obtained from a subjective evaluation to convert acoust...
متن کاملEuclidean Distances as measures of speaker similarity including identical twin pairs: A forensic investigation using source and filter voice characteristics
There is a growing consensus that hybrid approaches are necessary for successful speaker characterization in Forensic Speaker Comparison (FSC); hence this study explores the forensic potential of voice features combining source and filter characteristics. The former relate to the action of the vocal folds while the latter reflect the geometry of the speaker's vocal tract. This set of features h...
متن کاملThe effect of bilateral subthalamic nucleus deep brain stimulation (STN-DBS) on the acoustic and prosodic features in patients with Parkinson’s disease: A study protocol for the first trial on Iranian patients
Background: The effect of subthalamic nucleus deep brain stimulation (STN-DBS) on the voice features in Parkinson’s disease (PD) is controversial. No study has evaluated the voice features of PD underwent STN-DBS by the acoustic, perceptual, and patient-based assessments comprehensively. Furthermore, there is no study to investigate prosodic features before and after DBS in PD. The curren...
متن کاملطراحی یک روش آموزش ناموازی جدید برای تبدیل گفتار با عملکردی بهتر از آموزش موازی
Introduction: The art of voice mimicking by computers, has with the computer have been one of the most challenging topics of speech processing in recent years. The system of voice conversion has two sides. In one side, the speaker is the source that his or her voice has been changed for mimicking the target speaker’s voice (which is on the other side). Two methods of p...
متن کاملA relation between voice quality and acoustic features
. The present study deals with a voice quality that is referred to as “intelligible” or “articulate” voice. The purpose of the present study is to find a relation between the voice quality and acoustic features. The speech material is a read version of a simple short Japanese sentence “a o i so ra” in order to eliminate these linguistic variations. A set of speech samples uttered by fifteen adu...
متن کامل